Breaking Value Symmetry* 



Toby Walsh 

NICTA and UNSW 
Sydney, Australia 
tw@cse.unsw.edu.au 

1 Introduction 

One common type of symmetry is when values are symmetric. For example, if we 
are assigning colours (values) to nodes (variables) in a graph colouring problem then 
we can uniformly interchange the colours throughout a colouring. For a problem with 
value symmetries, all symmetric solutions can be eliminated in polynomial time 11121 . 
However, as we show here, both static and dynamic methods to deal with symmetry 
have computational limitations. With static methods, pruning all symmetric values is 
NP-hard in general. With dynamic methods, we can take exponential time on problems 
which static methods solve without search. 

2 Background 

A constraint satisfaction problem consists of a set of variables, each with a domain of 
values, and a set of constraints specifying allowed combinations of values for given 
subsets of variables. A solution is an assignment of values to variables satisfying the 
constraints. Variables take one value from a given finite set. Symmetry occurs in many 
constraint satisfaction problems. A value symmetry is a permutation of the values that 
preserves solutions. More formally, a value symmetry is a bijective mapping a on the 
values such that if X\ = di, .. . , X n = d n is a solution then X\ = <r(di), . .. , X n = 
(j(d n ) is also. A variable symmetry, on the other hand, is a permutation of the variables 
that preserves solutions. More formally, a variable symmetry is a bijective mapping 
a on the indices of variables such that if X\ = d\ , . . . , X n — d n is a solution then 
^o-(i) = d\, . . . , X a r n \ = d n is also. Symmetries are problematic as they increase the 
size of the search space. For instance, if we have m interchangeable values, symmetry 
increases the size of the search space by a factor of m\. 

Many constraint solvers explore the space of partial assignments enforcing some 
local consistency. We consider four local consistencies for finite domain variables Given 
a constraint C, a support is assignment to each variable of a value in its domain which 
satisfies C. A constraint is generalized arc consistent (GAC) iff for each variable, every 
value in its domain belongs to a support. A set of constraints is GAC iff each constraint 
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is GAC. On binary constraints, GAC is simply called arc consistency (AC). A set of 
binary constraints is singleton arc consistent (SAC) iff we can assign every variable with 
each value in its domain and make the resulting problem arc consistent (AC). Finally, 
a set of binary constraint is k-consistent iff each k — 1 assignment can be consistently 
extended to a fcth variable, and is strongly k-consistent iff it is j-consistency for all 
j < k. We will compare local consistency properties applied to sets of constraints, C\ 
and c 2 which are logically equivalent. As in [4|, a local consistency property <P on ci is 
as strong as on c 2 iff, given any domains, if <P holds on C\ then ^ holds on c 2 ; $ on 
ci is stronger than & on c 2 iff ^ on c\ is as strong as ^ on c 2 but not vice versa; <P on 
Ci is equivalent to ^ on c 2 iff <P on c\ is as strong as 9 on c 2 and vice versa. 



3 Static methods 

One simple and common mechanism to deal with symmetry is to add constraints which 
eliminate symmetric solutions [5 1. Suppose we have a set £ of value symmetries. Based 
on J6), we can eliminate all symmetric solutions by posting a global constraint which 
ensures that the solution is ordered lexicographically before any of its symmetries. More 
precisely, we post the global constraint ValSymBreak(Z', [X\, . .. , X n ]) which en- 
sures [Xi, . . . , X n ] <i ox [a(Xi), . . . , a(X„)} for all a £ £ where X\ to X n is a 
fixed ordering on the variables. Unfortunately, pruning all values from such a symme- 
try breaking constraint is NP-hard. 

Theorem 1 Deciding //"ValSymBreak(Z', [Xi,... ,X n ]) is GAC is NP-complete, 
even when \£\ is linearly bounded. 

Proof: Membership in NP follows by giving a support for every possible assignment. 
To prove it is NP-hard, we give a reduction from a 3-SAT problem in N Boolean vari- 
ables and M clauses. We construct a CSP with N+M+l variables over 4N+2 possible 
values. The first AN values partition into 2N interchangeable pairs. The values Ai — 3 
and Ai — 2 are interchangeable, as are 4i — 1 and 4i for 1 < i < N. The values 4i — 3 
and 4i — 2 represent Xi being true, whilst the values 4z — 1 and Ai represent Xi being 
false. The final two values, AN + 1 and 4A^ + 2 are not interchangeable. The first N 
CSP variables represent a "truth assignment". We have Xi £ {Ai — 3, Ai — 2, Ai — 1, Ai} 
for 1 < i < N. The next M CSP variables ensure at least one literal in each clause 
is true. For example, if the ith clause is Xj V ->Xk V xi, then the domain of X^ + i is 
{Aj - 3, Aj -2,4k- 1, Ak, AI - 3, AI - 2}. The final variable X N+M +i is a "switch" 
and has the domain {4^+1, 4^ + 2}. Note that all variables have symmetric domains. 

We have two sets of constraints. First, we have the constraints odd(Xjv+M+i) — * 
odd(Xi) for 1 < i < N and odd(XM+M+i) ~* even(X]y + j) for 1 < j < M. Second, 
we have the constraints odd(XN+M+x) — * PHP(N, N + 1) and even(XN+M+i) — > 
PHP(N, N) where PHP(i, j) is a pigeonhole constraint which holds iff the variables 
Xi to Xi take j distinct values. Note that PHP(N,N + 1) is unsatisfiable and that 
PHP(N, N) is satisfiable. Thus, the constructed CSP is unsatisfiable if X^+m+i — 
AN 4- 1 and satisfiable if Ajv+m+i = 4A^ + 2. Note that if we take any solution of the 
CSP and permute any of the interchangeable values, we still have a solution. Thus, if 



E is the set of symmetries induced by these interchangeable values, it is permissible to 
add ValSymBreak(Z', [Xi, X n ]) to this CSP. 

Suppose our branching heuristic sets the switch variable Xn+m+i to 4iV + 1. En- 
forcing AC on the binary constraints prunes the domains of Xi to {4i — 3, 4i — 1} for 
1 < i < N. Similarly, the domain of X^ + i is reduced to {4j — 2, 4k, 41 — 2}. Consider 
now finding a support for ValS YMBreak given this particular subproblem. X^ + i can 
only take the value 4j — 2 if Xj had previously been assigned 4j — 3. In other words, 
X^+i can only take the value 4j — 2 if Xj is set to true in the "truth assignment". Sim- 
ilarly, X^+i can only take the value 4k if Xk had previously been assigned 4k — 1. In 
other words, Xjv+i can only take the value 4k if Xk is set to false in the "truth assign- 
ment". Finally, X^+i can only take the value 41 — 2 if Xj had previously been assigned 
41 — 3. In other words, Xjv+i can only take the value 4Z — 2 if xi is set to true in the 
"truth assignment". Thus, at least one of the literals in the ith clause must have been set 
to true in the "truth assignment". Hence, there is a support for ValSymBreak iff the 
original 3-SAT problem is satisfiable. By Theorem 3, \S\ can be linearly bound. □ 

This is a somewhat surprising result. Whilst it is polynomial to eliminate all sym- 
metric solutions either statically (2) or dynamically (T), it is NP-hard to lookahead and 
prune all symmetric values. Equivalently, whilst we can avoid visiting symmetric leaves 
of the search tree in polynomial time, avoiding symmetric subtrees is NP-hard. 

4 Dynamic methods 

An alternative to static methods which add constraints to eliminate symmetric solutions 
are dynamic methods which modify the search procedure to ignore symmetric branches. 
For example, with value symmetries, the GE-tree method can dynamically eliminate all 
symmetric leaves in a backtracking search procedure in 0(n 4 log(n)) time (TJ. How- 
ever, as we show now, such dynamic methods may not prune all symmetric subtrees 
which static methods can do. Suppose we are at a particular node in the search tree 
explored by the GE-tree method. Consider the current and all past variables seen so far. 
The GE-tree method can be seen as performing forward checking on a static symmetry 
breaking constraint over this set of variables. This prunes symmetric assignments from 
the domain of the next branching variable. Unlike static methods, the GE-tree method 
does not prune deeper variables. By comparison, static symmetry breaking constraints 
can prune deeper variables, resulting in interactions with the problem constraints and 
additional domain prunings. For this reason, static symmetry breaking methods can 
solve certain problems exponentially quicker than dynamic methods. 

Theorem 2 There exists a model of the pigeonhole problem with n variables and n + 
1 interchangeable values such that, given any variable and value ordering, the GE- 
tree method explores 0(2") branches, but which static symmetry breaking methods can 
solve in just 0(n 2 ) time. 

Proof: The n + 1 constraints in the CSP are V"=i = 3 f° r 1 < J < n- + 1, and 
the domains areX^ 6 {1, . . . , n + 1} for 1 < i < n. The problem is unsatisfiable by 
a simple pigeonhole argument. Any of the static methods for breaking value symmetry 



presented later in this paper will prune n+1 from every domain in 0(n 2 ) time. Enforc- 
ing GAC on the constraint V"=i Xi = n + 1 then proves unsatisfiability. On the other 
hand, the GE-tree method irrespective of the variable and value ordering, will only ter- 
minate each branch when n — 1 variables have been assigned (and the last variable is 
forced). A simple calculation shows that the size of the GE-tree more than doubles as 
we increase n by 1. Hence we will visit 0(2 n ) branches before declaring the problem 
is unsatisfiable. □ 

This theoretical result supports the experimental results in [2| showing that static 
methods for breaking value symmetry can outperform dynamic methods. Given the 
intractability of pruning all symmetric values in general, we focus in the rest of the paper 
on a common and useful type of value symmetry where symmetry breaking methods 
have been proposed that take polynomial time: we will suppose that values are ordered 
into partitions, and values within each partition are uniformly interchangeable. 

5 Generator symmetries 

One way to propagate ValSymBreak is to decompose it into individual lexicograph- 
ical ordering constraints, [Xx, . . . , X n ] <\ cx [a(Xx), ■ ■ ■ , cr(X n )] and use one of the 
propagators proposed in [7 1 or [8 1. Even if we ignore the fact that such a decomposition 
may hinder propagation (see Theorem 2 in (8)), we have to cope with E, the set of sym- 
metries being exponentially large in general. For instance, if we have m interchangeable 
values, then E contains ml symmetries. To deal with large number of symmetries, Aloul 
et al. suggest breaking only those symmetries corresponding to generators of the group 
|9|. Consider the generators which interchange adjacent values within each partition. 
If the m values partition into k classes of interchangeable values, there are just m — k 
such generators. Breaking just these symmetries eliminates all symmetric solutions. 

Theorem 3 If E is a set of symmetries induced by interchangeable values, and E g 
is the set of generators corresponding to interchanging adjacent values then posting 
ValSymBreak(Z' s , [X\, . . . , X n ]) eliminates all symmetric solutions. 

Proof: Assume ValSymBreak(Z' £ ,, [Xi,. . . , X n ]). Consider any two interchange- 
able values, j and k where j < k, Let aj G E g be the symmetry which swaps just 
j with j + 1. To ensure [Xx, ■ ■■ , X n ] <i cx [dj(Xi), . .. , <jj(X n )], j must occur be- 
fore j + 1 in Xi to X n . By transitivity, j therefore occurs before k. Thus, for the 
symmetry a' which swaps just j with k, [Xi,...,X n ] <i cx [a'(Xi),... ,a'(X n )]. 
Consider now any symmetry a € E. The proof proceeds by contradiction. Suppose 
[X\, ... , X n ] >icx [cr(Xi), . . . , a(X n )]. Then there exists some j with Xj > a{Xj) 
and Xi = a(Xi) for all i < j. Consider the symmetry a 1 which swaps just Xj with 
a(Xj). As argued before, [X±, . . . , X n ] <\ cx [a'(Xx), . . . , a'(X n )]. But this contra- 
dicts [Xx, . . . , X n ] >i cx [a(Xi), . . . , a(X n )) as a and a' act identically on the first j 
variables in X\ to X n . Hence, [X\, . . . , X n ] <i ox [<j(Xi), . . . , a(X n )}. □ 



Not surprisingly, reducing the number of symmetry breaking constraints to linear 
comes at a cost. We may not prune all symmetric values. 



Theorem 4 If £ is a set of symmetries induced by interchangeable values, and S g 
is the set of generators corresponding to interchanging adjacent values then GAC 
on ValSymBrEAK(.E', [Xi,... , X n ]) is stronger than GAC on [Xi,...,X n ] <i cx 
[a(Xx ),..., a(X n )} for all a G E a . 

Proof: Clearly it is at least as strong. To show it is stronger, suppose all values are 
interchangeable with each other. Consider X\ = 1, X2 £ {1, 2}, X3 G {1, 3}, X4 G 
{1, 4} and X 5 = 5. Then enforcing GAC on ValSymBreak(Z', [X l7 . . . , X 5 ]) prunes 
1 from X 2 , X 3 and X 4 . However, [Xx,..., X 5 ] < lcx [a(Xx), cr(X 5 )} is GAC for 
all a € S g . . □ 

Finally, it is not hard to see that there are other sets of generators for the symmetries 
induced by interchangeable values which do not necessarily eliminate all symmetric 
solutions (e.g. with the generators which interchange the value 1 with any i, we do not 
eliminate either the assignment Xi = 1, X 2 — 2 or the symmetric assignment Xi = 1, 
•V :! :'..). 

6 Puget's decomposition 

With value symmetries, a second method that eliminates all symmetric solutions is a de- 
composition due to [2|. Consider a surjection problem (where each value is used at least 
once) with interchangeable values. We can channel into dual variables, Zj which record 
the first index using the value j by posting the binary constraints: X{ = j — » Zj < i and 
Zj = i — > Xi = j for all 1 < i < n, 1 < j < m. We can then eliminate all symmetric 
solutions by insisting that interchangeable values first occur in some given order. That 
is, we place strict ordering constraints on the Zk within each class of interchangeable 
values. Puget notes that any problem can be made into a surjection by introducing m 
additional new variables, X n +i to X n + m where X n +i = i. These variables ensure that 
each value is used at least once. In fact, we don't need additional variables. It is enough 
to ensure that each Zj has a dummy value, which means that j is not assigned, and to 
order (dummy) values appropriately. Unfortunately, Puget's decomposition into binary 
constraints hinders propagation. 

Theorem 5 If U is a set of symmetries induced by interchangeable values, then GAC 
on ValSymBreak(17, [Xi, . . . , X n \) is stronger than AC on Puget's decomposition 
into binary constraints. 

Proof: It is clearly at least as strong. To show it is stronger, suppose all values are 
interchangeable with each other. Consider X\ = 1, X2 £ {1, 2}, X3 G {1, 3}, X4 G 
{3,4}, X 5 =2,X 6 = 3, X 7 = 4, Z x = 1, Z 2 G {2,5}, Z 3 G {3,4,6}, and Z 4 G 
{4, 7}. Then all Puget's symmetry breaking constraints are AC. However, enforcing 
GAC on ValSymBreak(Z', [Xx, X 5 }) will prune 1 from X 2 . □ 

If all values are interchangeable with each other, we only need to enforce a slightly 
stronger level of local consistency to prune all symmetric values. More precisely, en- 
forcing singleton arc consistency on Puget's binary decomposition will prune all sym- 
metric values. 



Theorem 6 If all values are interchangeable and £ is the set of symmetries induced by 
this then GAC on ValSymBreak(I7, [X\,. . . , X n \) is equivalent to SAC on Puget's 
decomposition into binary constraints. 

Proof: Suppose Puget's encoding is AC. We will show that there is at least one support 
for ValSymBreak. We assign Z\ to Z m in turn, giving each the smallest remaining 
value in their domain, and enforcing AC on the encoding after each assignment. This 
will construct a support without the need for backtracking. At each choice point, we 
ensure that a new value is used as soon as possible, thus giving us the most freedom 
to use values in the future. Suppose now that Puget's encoding is SAC. Then, by the 
definition of SAC, we can assign any variable with any value in its domains and be 
sure that the problem can be made AC without a domain wipeout. But if the problem 
can be made AC, it has support. Thus every value in every domain has support. Hence 
enforcing SAC on Puget's decomposition ensures that ValSymBreak is GAC. □ 

We might wonder if singleton arc-consistency is enough for arbitrary value symme- 
tries. That is, does enforcing SAC on Puget's encoding prune all symmetric values? We 
can prove that no fixed level of local consistency is sufficient. Given the intractability 
of pruning all symmetric values in general, this result is not surprising. 

Theorem 7 For any given k, there exists a value symmetry and domains for which 
Puget's encoding is strongly k-consistent but is not k + 1-consistent. 

Proof: We construct a CSP problem with 2k + 1 variables over 2(fc + 1) possible 
values. The 2(k + 1) values partition into k + 1 pairs which are interchangeable. More 
precisely, the values i and k + 1 + i are interchangeable for 1 < i < k + 1. The first k 
variables of the CSP have k + 1 values between them (hence, one value is not taken). 
More precisely, 6 {i, i + 1} for 1 < i < k. The remaining k + 1 variables then 
take the other k + 1 values. More precisely, Xk+i = k + 1 + i for 1 < i < k + 1. The 
values 1 to k + 1 need to be used by the first k variables, X\ to Xk so that the last k + 1 
variables, Xk+i to X 2 (k+i) can use the values k + 2 to 2(k + 1). But this is impossible 
by a pigeonhole argument. Puget's encoding of this is strongly fc-consistent. since any 
assignment of k — 1 or less variables can be extended to an additional variable. On the 
other hand, enforcing k + 1 -consistency will discover that the CSP has no solution. □ 
Finally, we compare this method with the previous method based on breaking the 
symmetries corresponding to the generators which interchange adjacent values. 

Theorem 8 If E is a set of symmetries induced by interchangeable values, and E g is 
the set of generators interchanging adjacent values then AC on Puget's decomposition 
for £ is stronger than GAC on [X\,. . . , X n ] <i ox [a(Xi), . . . , a(X n )]for all a G E g . 

Proof: Suppose Puget's decomposition is AC. Consider the symmetry a which inter- 
changes j with j + 1. Consider any variable and any value in its domain. We show how 
to construct a support for this assignment. We assign every other variable with j if it 
is in its domain, otherwise any value other than j + 1 and failing this, j + 1. Suppose 
this is not a support for [X\, . . . , X n ] <i ox [a(Xi), . .. , a(X n )]. This means that in the 
sequence from X\ to X n , we had to use the value j + 1 before the value j. However, as 
Puget's decomposition is AC, there is a value in the domain of Zj smaller than Zj + \. 



This contradicts j + 1 having to be used before j. Hence, this must be a support. Thus 
[Xi, ... , X n ] <icx [cr(Xi), . . . , (t(X„)] is GAC for all a G S g . To show that AC on 
Puget's decomposition is stronger consider again the example used in the proof of The- 
orem [4] The lexicographical ordering constraint for each generator a £ E g is GAC 
without any domain pruning. However, enforcing AC on Puget's decomposition prunes 
1 fromX 2 , X 3 andXi. □ 

7 Value precedence 

A third method to break symmetry due to interchangeable values uses the global prece- 
dence constraint 0. PRECEDENCE ( \X\ , ... , X n ] ) holds iff min{i | Xi = j V i = 
n + 1} < min{i \Xi = kVi = n + 2} for all j < k. That is, the first time we use 
j is before the first time we use k for all j < k. Posting such a constraint eliminates 
all symmetric solutions due to interchangeable values. In [10], a GAC propagator for 
such a precedence constraint is given which takes 0(nm) time. It is not hard to show 
that Precedence ( [Xi, . . . , X n ]) is equivalent to ValSymBreak(Z', [X%, . . . , X n ]) 
where £ is the set of symmetries induced by interchangeable values. Hence, enforc- 
ing GAC on such a precedence constraint prunes all symmetric values in polynomial 
time. Precedence constraints can also be defined when values partition into several in- 
terchangeable classes; we just insist that values within each class first occur in a fixed 
order. In [ 10 1, a propagator for such a precedence constraint is proposed which takes 
0(n Y[i Trii) time where rtii is the size of the ith class of interchangeable values. This is 
only polynomial if we can bound the number of classes of interchangeable values. This 
complexity is now not so surprising. We have shown that pruning all symmetric values 
is NP-hard when the number of classes of interchangeable values is unbounded. 

8 Related work 

Puget proved that symmetric solutions can be eliminated by the addition of suitable 
constraints [5 1. Crawford et al. presented the first general method for constructing vari- 
able symmetry breaking constraints |6|. Petrie and Smith adapted this method to value 
symmetries by posting a suitable lexicographical ordering constraint for each value 
symmetry [13]. Puget and Walsh independently proposed propagators for such symme- 
try breaking constraints [78]. To deal with the exponential number of such constraints, 
Puget proposed a global propagator which does forward checking in polynomial time 
|7| . To eliminate symmetric solutions due to interchangeable values, Law and Lee for- 
mally defined value precedence and proposed a specialized propagator for a pair of 
interchangeable values [3|. Walsh extended this to a propagator for any number of in- 
terchangeable values ifTOl . An alternative way to break value symmetry statically is 
to convert it into a variable symmetry by channelling into a dual viewpoint and using 
lexicographical ordering constraints on this dual view II 141 1 21 . A number of dynamic 
methods have been proposed to deal with value symmetry. Van Hentenryck et al. gave a 
labelling schema for eliminating all symmetric solutions due to interchangeable values 
[15]. Inspired by this method, Roney-Dougal et al. gave a polynomial method to con- 
struct a GE-tree, a search tree without value symmetry (TJ. Finally, Sellmann and van 



Hentenryck gave a 0(nd 3 5 + n 2 d 2 ) dominance detection algorithm for eliminating all 
symmetric solutions when both variables and values are interchangeable [ 16 1. 

9 Conclusion 

Value symmetries can be broken either statically (by adding constraints to prune sym- 
metric solutions) or dynamically (by modifying the search procedure to avoid sym- 
metric branches). We have shown that both approaches have computational limitations. 
With static methods, we can eliminate all symmetric solutions in polynomial time but 
pruning all symmetric values is NP-hard in general (or equivalently, we can avoid visit- 
ing symmetric leaves of the search tree in polynomial time but avoiding symmetric sub- 
trees is NP-hard). With dynamic methods, we typically only perform forward checking 
and can take exponential time on problems which static methods solve without search. 
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